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AutO-DeCOTvolutincr Combinatorial Libraries 

The present invention relates to the field of apparatus 
and methods which provide the rapid generation of 
structure/activity relationships using auto-deconvoluting 
5 combinatorial libraries, which facilitate the invention of 
novel active compounds . The invention provides apparatus 
and methods which can be used for the rapid generation of 
structure/activity relationship (SAR) data, and, 
therefore, the characterisation of the active motif of any 
io group of compounds. 

The invention provides libraries of compounds which 
interact with an active moiety, and apparatus and methods 
to identify such compounds. The active moieties may be 
(but are not limited to) enzymes, receptors, antibodies, 
etc. The interaction of the active moiety with the 
compounds of the library may be (but is not limited to) 
the interaction of a substrate or inhibitor with an 
enzyme, the interaction of a ligand with a receptor, the 
interaction of an antigen or antigenic epitope with an 
antibody, etc. 

The following list illustrates a number of enzyme classes 
and subclasses: 

Enzvme Clagflfrp 

1 . Oxidoreductase 
25 a) dehydrogenase 

b) oxidase 

c) peroxidase 



15 
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d) catalase 

2 . Hydrolase 

f) peptidase (proteolytic enzyme) 



3 . Transferase 
5 g) aminotransferase 

h) kinase 

i) glucosyl transferase 



4 . Lyase 

j ) decarboxylase 
io k) dehydra tease 

5. Isomerase 
1) racemase 
m) mutase 



6 . Ligase 
15 n) synthetase 

o) carboxylase 



Proteases of interest include (but are not limited to) : 



1. Aspartyl proteases, such as renin, HIV, cathepsin D 
and cathespin E etc . 



20 2. Metalloproteases, such as ECE, gelatinase A and B, 
collagenases, stromolysins etc. 

3. Cysteinyl proteases, such as apopain, ICI, DerPI, 
cathepsin B, cathepsin K etc. 
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4. Serine proteases, such as thrombin, factor Vila, 
factor Xa, elastase, trypsin. 

5. Threonyl proteases, such as proteasome S. 
Proteases 

Many therapeutically useful drugs act as enzyme 
inhibitors. In particular, proteolytic enzyme inhibitors 
have been the focus of much attention in the 
pharmaceutical industry, because they play a variety of 
roles in a multitude of biological systems. Their 
proteolytic activities are related to processes ranging 
from cell invasion associated with metastatic cancer to 
evasion of an immune response, as seen in certain 
parasitic organisms; from nutrition to intracellular 
signalling to the site-specific proteolysis of viral 
proteases and eukaryotic hormone-processing enzymes. 
However, the traditional random screening methods for the 
identification of lead molecules as inhibitors of 
proteolytic enzymes are often laborious and time- 
consuming. Therefore new and efficient methods which can 
accelerate the drug discovery process are greatly in 
demand . 

We consider that proteases contain an active catalytic 
site which tends to become increasingly activated as the 
recognition pockets 1 {S 1 and S 2 etc) and (Sj' and S 2 ' etc) 
become better occupied. Therefore, it is important that 
those parts (P, and P 2 etc) (P^ and P 2 ' etc) of the 
inhibitor that best fit into these pockets are identified 
as quickly as possible in order to design novel protease 



10 
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inhibitors. Therefore, we have devised a combinatorial 
method for the rapid identification of these binding 
motifs which. will greatly expedite the synthesis of 
inhibitors of a variety of proteolytic enzymes such as 
5 aspartyl proteases, serine proteases, metallo proteases 
and cysteinyl proteases. 

The use of a fluorescence resonance energy transfer (FRET) 
substrate for the analysis of proteolytic enzyme 
specificity was first published by Carmel. 2 Since then 
many different quenched fluorogenic substrates for 
measuring enzyme inhibition have been described in the 
literature. 4 * 11 These substrates contain a fluorophore, F, 
in a P position (vide supra) , which is quenched by another 
group, Q, present in a P' position (vide supra) and 
separated from F by the scissile bond. The advantage of 
the positioning of these residues, F and Q, is that 
cleavage of a peptide bond occurs between the two natural 
residues and, therefore, represents a more natural 
hydrolytic event rather than the cleavage and release of a 
C- terminal chromophore. 

For example, Bratovanova and Petkov 12 have synthesised 
fluorogenic substrates from peptide 4-nitroanilides . N- 
acylation of peptide 4-nitroanilides with the aminobenzoyl 
(ABz) group yielded substrates that are internally 
25 quenched by the presence of the 4-nitroanilide moiety. 

Upon hydrolysis of the aminoacyl-4-nitroanilide bond, the 
highly fluorescent N-ABz group is released attached either 
to an amino acid or peptide. 



15 
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Immobilised libraries, where substrates are attached to a 
polymer or biopolymer support, have also been used for 
mapping protease binding sites. 11 Singh et al. reported 
recently that enzymatic substrate activity of 38 selected 
5 octapeptides attached via a linker to controlled pore 
glass is predictive of the same activity of similar 
peptides in solution. However, these results are 
preliminary and only for a specific example. Therefore, it 
is not clear whether immobilised substrates attached to 

10 polymers can reliably replace soluble substrates in 

mapping the hindered protease binding sites, especially 
since the hydrophilic or lipophilic nature of the polymer 
and the size of the interstices within the polymer are 
bound to influence the reaction between the enzyme and its 

is substrates. 

Mixtures of internally quenched, fluorogenic substrates 
have also recently been described in which the quencher 
group, Q, is 2,4-dinitrophenyl (Dnp) and is attached to 
the P side of the scissile bond, while the fluorogenic 
20 group, is N-methyl anthranilic acid (Nma) and is attached 
to the P' side. 14 

Examples of other Donor -Acceptor Chromophore Pairs that 
have been applied to Biological Systems are shown in Table 



25 



Table 1 : Donor-Acceptor Chromophore Pairs That Have Been Applied To 
Biological Samples 



Donor 



Acceptor 



Donor 



Acceptor 



Naphthalene 
IANBD 
30 IAEDANS 



Dansyl 
DDPM 
DDPM 



IAEDANS 

e-A 

NBD 



TNP-ATP 

IANBD 

SRH 
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DNSM 


LY 


ISA 


TNP 




IAEDANS 


1ANBD 


Dansyl 


ODR 




E-A 


FjDNB 


DANZ 


IAF 




Pyrcne 


Bimane 


FNAI 


EITC 


5 


ANAI 


IPM 


NBD 


LRH 




1AANS 


IAF 


IAF 


EIA 




e-A 


F 2 DPS 


FITC 


ENAI 




e-A 


DDPM 


Proflavin 


ETSC 




IAEDANS 


TNP 


CPM 


TNP-ATP 


10 


MNA 


DACM 


IAEDANS 


IAF 




PM 


NBD 


CPM 


Fluorescein 




FITC 


TNP-ATP 


IAEDANS 


FITC 




DANZ 


DABM 


FITC 


TMR 




NCP 


CPM 


IAF 


TMR 


15 


NAA 


DNP 


CF 


TR 




LY 


TNP-ATP 


CPM 


FTS 




IAF 


diI-C lB 


e-A 


TNP-ATP 




IAF 


TMR 


CPM 


FM 




FMA 


FMA 


LY 


EM 


20 


PM 


DMAMS 


FITC 


EITC 




mBBR 


FITC 


IAEDANS 


DiO-C l4 




mBBR 


DABM 


IAF 


ErITC 




e-A 


NBD 


FITC 


EM 




Pyrene 


Coumarin 


FITC 


ETSC 


25 


IPM 


FNAI 


FITC 


ErITC 




IAEDANS 


DABM 


BPE 


CY5 



ANAI, 2-anthracene N-acetylimidazole; BPE, B-phycoerythrin; CF, carboxyfluorescein 
succinimidyl ester; CPM, 7-doethy!amino-3-(4 , n^leimidylphenyl)-4-rnethylcournarin; CY5, 
caii>oxymethylindocyaiiine-N-hydroxysuccinimidyl ester, diI-C u , l,r-dioctadecyl-3,3,3'3'- 

30 tetramethyl-indocarbocyanine; DiO-C, 4 , 3,3'-ditetradecyloxacarbocyanine; DABM, 4- 

dimethylaniniphenylazo-phenyM'-maleunide; DACM, (7-(dimethylamino)coiiimrin-4-yl)- 
acetyl; DANZ, dansylaziridine; DDPM, N-(4-dimethylarnino-3-5-dinitrophenyl)imleimide; 
DACM, di-methylamino-4-maleimidostilbene; DMSM, N-(2,5-dimethoxystiben-4-yl)- 
maleimide; DNP, 2,4-dinitrophenyl; e-A, l,N 6 -ethenoadenosine;EIA, 5-(iodoacetetamido)eosin; 

35 EITC, eosin-5-isothiocyanate; ENAI, eosin N-acetylimidazole; EM, eosin maleimide; ErITC, 

erythrosin-5*-isothiocyanate; ETSC, eosin thiosemicarbazide; F 2 DPB, 1,5-diiluoro- 
l^'dinitrobenzene; F 2 DPS, 4,4'^ifluro-3,3'dmitropheylsuiphone; FITC, fluorescein N- 
acetylimidazole; FTS, fluorescein thiosemicarbazide; IAANS; 2-((4*- 
iodoacetarnido)anilino)naphthalene-6-sulphonic acid; IAEDANS, 5-(2- 

4 o ((iodoacety l)amino)ethy Iamino)-naphthlene- 1 -sulphonic acid; IAF, 5- 

iodoacetamidofluorescein; IANBD, N-((2-(iodoacetoxy)ethyl)-N-methyl)amino-7-nitrobenz-2- 
oxal,3,diazole; IPM, 3(4-isothiocyanatophenyl)7-dicthyl-4-methylcoumarin; ISA, 4- 
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(iodoacctamido)salicylic acid;LRH, lissaminerho-2,l f 3-benzoxadia2ol-4-yl; NCP, N- 
cyclohcxyl-N'-{l-pyrenyl)carbodiimide; ODR, octadecylrhodaminc; PM, N.(l-pyrene)- 
maleimide; SRHsuIphurhodaminc; TMR, tctramethylrhodamine; TNP, trinitrophenyl; TR, 
Texas red 

5 from: Wu, P. and Brand; L. 1994. Anal. Biochem. 218, 1-13. 

The specificity of soluble peptide libraries have been 
determined. 1516 Berman et al. described" an HPLC mass 
spectrometry technique in which 6 mixtures of 128 peptides 
were synthesised which were N-terminally labelled with the 

o Dnp group in order to allow UV monitoring on the HPLC. The 
disadvantage of this approach is that each assay mixture 
has to be individually analysed, because no fluorogenic 
substrate is revealed, and that the effective 
concentration of each separate component is limited by the 

5 size of the mixture because of overall solubility factors. 

Drevin et al. 17 have suggested the use of individually 
synthesised fluorogenic substrates for the determination 
of enzyme activity using a chromophore which chelates 
lanthanide ions. Garmann and Phillips have suggested the 
» use of FRET substrates in which the fluorogenic and 
quencher moieties are attached via thiol or amino 
functional groups after the peptide has been synthesised, 
but this has the disadvantage that they are not in library 
form and that these functional amino and thiol groups need 
to be selectively revealed after the peptide has been 
synthesised. Wang et al . have suggested the use of the 
EDANS and DABCYL fluorescor and quencher pairing for the 
individual synthesis of substrates for proteolytic 
enzymes . 
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The above methods which have used FRET techniques for the 
mapping of the active site around a specific protease 
suffer from one or more of the following disadvantages: 



i. because of general aqueous insolubility they do not 

5 produce mixtures of compounds in a form suitable for 

high throughput screening in aqueous solution. 

ii . the derivatised compounds cannot be prepared in 
combinatorial library form using solid phase 
techniques . 

10 iii. the mixtures which have been used 8 - 9 were not self- 
decoding, and needed time-consuming deconvolutive 
resynthesis for identification of the active 
molecules . 



Kinases 

is Protein kinases are intracellular enzymes that play key 
roles in cell growth, differentiation and inter-cell 
communication. Aberrant protein kinase activity has been 
implicated in many disease states including several forms 
of cancer and severe -combined immunodeficiency disease. 

20 All serine/ threonine and tyrosine protein kinases have a 
region of approximately 300 amino acids known as the 
catalytic subunit which has evolved from a common ancestor 
kinase (Hunter et al, 1991). Crystal structure 
determination of several kinases has shown that they all 

25 have a common bi-lobal structure. The amino- terminal part 
of the subunit encodes a small lobe responsible for the 
binding of ATP, whereas the carboxy- terminal residues 
encode a larger lobe important for protein substrate 
binding. In the tertiary structure of the active kinase, 

30 both the ATP and the protein substrate binding sites are 
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brought together allowing transfer of the ATP y-phosphate 
to the amino acid acceptor on the protein substrate. The 
protein/peptide binding groove stretches across the face 
of the large lobe between two a-helices and under the 
s small lobe. This groove therefore contains the residues 
important for substrate specificity of the kinase. 
Protein kinases are arranged in kinase cascades within the 
cell, providing the ability for signal amplification in 
post -transduction pathways. This amplification relies on 

io the upstream kinase specifically activating its downstream 
partner. For this reason, protein kinases have developed 
remarkable substrate specificities which prevent unwanted 
crosstalk between different kinase cascades. We believe 
that such substrate specificity can be exploited in the 

is design of selective protein kinase inhibitors. 
Existing protein kinase lflhibi£Q£§ 
Non-pent idft inhibits™ 

Most available non-peptide protein kinase inhibitors do 
not target the substrate binding region, but compete with 

20 ATP for enzyme binding. These inhibitors can be 

engineered using conventional medicinal chemistry to 
demonstrate specific enzyme selectivity. An example of 
this strategy was the modification of the fungal 
metabolite staurosporine, a general protein kinase 

25 inhibitor, to the bis-indolyl-maleimides Ro 31-7549 and Ro 
31-8425 (fig. 17) which are selective protein kinase C 
inhibitors (Muid et al, 1991). Unfortunately, the potency 
of ATP antagonist inhibitors can be dramatically reduced 
in certain intracellular compartments due to the high 

30 (millimolar) levels of ATP. 
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The aromatic compound erbstatin (fig. 18) produced by- 
certain Streptomyces strains was identified in screens for 
reversal of protein tyrosine kinase -induced cell 
transformation. Erbstatin was subsequently shown to 
s inhibit the EGFR kinase in vitro with a k t of 5.58 /M and 
typical lineweaver-Burke competitive kinetics against a 
peptide substrate, while remaining non- competitive with 
ATP (Umezawa and Imoto, 1991) . The compound is readily 
inactivated in calf serum and so could not be considered 

10 as a serious candidate for therapeutic use. Attempts have 
been made to both increase the stability of erbstatin, by 
replacement of the N-formyl moiety, and bias the 
inhibition towards specific tyrosine kinases using a 
systematic modification of all parts of the molecule. 

is These synthetic erbstatin derivatives are known as 

tyrphostins (tyrosine £ho£phoryla£ion inhibitors) and are 
mainly used as tools in enzyme/signal transduction 
research, but have had limited success in animal models of 
disease (see Levitzki and Gazit, 1995 for an extensive 

20 review) . 

Pseudosubstrate inhibitors 

Research into the structure and function of protein 
kinases uncovered a natural method of regulating catalytic 
activity known as pseudosubstrate inhibition. This method 

25 uses sequences, either on the same molecule as the 

catalytic site (protein kinase C) or on a regulatory 
subunit molecule (cyclic cAMP-dependent protein kinase) 
that bind into the active site and inhibit kinase 
activity. These sequences do not have a phospho- acceptor 

30 residue, but modification of a single specific residue to 
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a phospho-acceptor results in the formation of a good 
substrate sequence. It is for this reason that the 
sequences are known as pseudosubstrates. Peptides based 
on pseudosubstrate motifs have been used as tools for 
inhibition of specific proteins kinases such as cyclic 
cAMP -dependent protein kinase (PKA) , protein kinase C 
(PKC) , myosin light chain kinase (MLCK) and calmodulin- 
dependent protein kinase II (Kemp et al, 1994) . 
Pseudosubstrate peptides of less than 20 amino acids have 
been shown to be potent low nanomolar inhibitors in vitro 
with pseudosubstrate-based peptides of six amino acids 
inhibiting protein kinases at less than one micromolar 
(Kemp et al, 1991) . 

It is clear that there is scope for a small molecular 
weight inhibitor of protein kinases that competes with 
substrate for binding to the active site. A rational and 
peptide proven approach would be to base these inhibitors 
on substrate molecules which can specifically access the 
catalytic site. By focusing on pseudosubstrates , this 
approach would also have the advantage of "in-built" 
specificity, an element which is generally missing when 
leads are identified from random screening approaches. 

Pyoteiyi k^age inhibitors in disease treating t 

Currently, there are no specific protein kinase inhibitors 
available for clinical use. Nevertheless, the potential 
for such compounds is clear with every major 
pharmaceutical company having projects for kinase 
inhibition in a number of therapeutic areas, most notably 
oncology, autoimmunity and transplant rejection. 
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Prototypic kinase inhibitor compounds are widely available 
from many companies, and are used as tools for in vitro 
signal transduction research. Several have formed the 
basis for extensive medicinal chemistry programmes 
5 designed to engineer both selectivity and enhanced 

activity into these compounds. A number of examples have 
been reported which indicate that this strategy has had 
some success in producing compounds active both in vitro 
within cell assay systems, and in animal models of 

10 disease. Staurosporine analogues (CGP41251 and UCN-01) 

have been used to show anti -tumour activity in vivo (Meyer 
et al, 1989; Akinagaka et al, 1991) and selective PKC 
inhibitors have been shown to work in animal models of 
acute inflammation (Bradshaw et al, 1993) . Moreover, the 

15 BMN tyrphostin AG490 has been shown to inhibit 

lymphoblastic leukaemia growth in animal models, 
correlating with its ability to inhibit the non-receptor 
intracellular tyrosine kinase JAK-2 (Meydan et al, 1996) . 
These promising preclinical studies have yet to be 

20 translated into the clinic, and it is possible that 

unforeseen toxicological problems, perhaps related to 
compound specificity, could be delaying their progress. 

During the past few years, there has been an explosion of 
knowledge relating to kinase structure, function and 

25 activity. New kinases have also been identified, some of 

which have a sufficiently restricted cellular distribution 
that one could envisage that a specific inhibitor would 
have selective effects e.g. a ZAP70 inhibitor would 
selectively inhibit activation of T cells and, to a lesser 

30 extent, NK cells. Together, these aspects offer the 
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promise of new approaches for the development of kinase 
inhibitors to treat a range of conditions. 

Method Of Protein Kinase Inhibitor n^cH^n 

generation of initial substrate information 

5 A technique has rec^ntly^been developed by L. Cantley's 

laboratory to provide, consensus peptide kinase substrate 
information (Songyang et al , 1^94).. Firstly, a degenerate 
peptide library of peptides with a central phospho- 
acceptor such as tyrosine or serine/threonine flanked by 

10 four unknown amino acids on each side is synthesised. The 
library is then phosphorylated by the protein kinase of 
interest and phosphorylated peptides isolated by DEAE- 
sephacel and ferric chelation chromatography. The 
phosphopeptide mixture is then sequenced and the frequency 

15 of each amino acid at every position -4 to +4 assessed to 
give a preferred substrate sequence. These studies have 
yielded consensus substrate information, and do not allow 
a detailed analysis of particular preferences for 
neighbouring residue interactions. Similarly, the analysis 

20 was performed solely with natural amino acids. 

Filamentous phage are now also being used to generate 
substrate information (Miami, Biotechnology Conference, 
. 1996) . Substrate information can also be obtained from 
knowledge of the physiological substrates for the kinase 

25 in vivo. 

Description of the Drawings 
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Figures 1 to 14 inclusive exemplify component 
distributions in the plates of a library matrix; 

Figure 15 illustrates a reaction scheme for production of 
compound number 4: Boc-Val-Ala-Leu-H wherein 

5 i. isobutylchloroformate, N-methylmorpholine, then N,0- 
dimethylhydroxylamine HC1, THF. 

ii. HC1, dioxan. iii. isobutylchloroformate, N- 
methylmorphline, then Boc-Ala-OH, THF. 

iv. HC1, dioxan. V. Boc-Val-Osuc, N-methylmorpholine, 
10 DMF. Vi. LAH; 

Figure 16 illustrates a reaction scheme for production of 
active inhibitors of Der PI. • 

Figure 17 shows the molecular structure of the Roche 
protein kinase C inhibitors. A; Ro 31-8425/002, 3-[8(RS)- 
15 (aminomethyl) -6, 7, 8, 9-tetrahydropryridol [1, 2-a] indol-10- 
yl] -4- { 1 -methyl- 3 -indolyl) -1H -pyrrole -2, 5-dione 
hydrochloride. B; Ro 31-7549/001, 3- [1- (3-aminopropyl) -3- 
indolyl] -4- (1 -methyl -3 -indolyl) -lH-pyrrole-2 , 5-dione 
hydrochloride . 

20 Figure 18 shows the molecular structure of the protein 
tyrosine kinase inhibitor erbstatin, isolated from 
Streptomyces MH435-hF3. 

Brief Description of the Invention 
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The present invention relates to the field of apparatus 
and methods which provide the rapid generation of 
structure-activity relationships using auto-deconvoluting 
combinatorial libraries, which facilitate the invention of 
5 novel active compounds. 

We describe herein apparatus and methods which can be used 
for the rapid generation of structure-activity 
relationship (SAR) data and, therefore, the 
characterisation of the active motif of any group of 
10 compounds . 

The invention provides two orthogonal sets of mixtures of 
compounds in solution providing two complementary 
combinatorial libraries indexed in two dimensions for 
autodeconvolution. These are referred to as primary and 
15 secondary libraries. 

The general concept of two orthogonal sets of mixtures 
indexed in two dimensions can be applied to various 
permutations of numbers of wells, plate layout, number of 
permutations per mixture etc. However, according to the 
20 invention the numerical interrelationship is defined as 
indicated below for libraries containing compounds with 
four variable groups B,C,D and E. 

General Deconvolution Formulae 

-Bb-Cc-Dd-n(Ee) - (I) 
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1) Primary and Secondary plates preferably have the same 
number of compounds per well [X] : otherwise there are 
two values, having X p and X s respectively. 

2) The primary library comprises [np] plates. 

5 If Rp.Cp=Rs.Cs, then the number of plates in the secondary 
library is also [np] . If not, the number of plates in the 
secondary library [ns] is: 

ns = Rp.Cp . np 



Rs .Cs 



10 



eg. A primary library of np=4, Rp=8, Cp=10 can be set out 
in an Rs=4, Cs=5 secondary library with the number of 
plates equal to: 



ns - 8 x 10 



np 



4x5 



15 



* 16 plates. 



Number of compounds per well 



-Bb-Cc-Dd-np (Ee) - 



(1) 



Number of possible combinations [k] is given by: 
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k = b.c.d.np.e (2) 

When number of wells on a plate = [N] , number of compounds 
per well = [X] and number of plates = [np] : 

k = X.N.np (3) 

5 However, number of wells [N] is also defined by the number 
of rows [Rp] and number of columns [Cp] : 

N = Rp. Cp (4) 
Combining (3) and (4): 
k = X.Rp.Cp.np (5) 
10 Combining (2) and (5) : 

b.c.d.np.e = X.Rp.Cp.np (6) 

Cancelling [np] from both sides of the equation : 

b.c.d.e = X.Rp.Cp (7) 

Two of the variables (e.g. b.c) on the left side of the 
is equation must each be equal in number to the number of 

columns [Cp] , whilst a remaining variable (e.g. d) on the 
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left side must be equal in number to the number of rows 
[Rp] . So: 

[Cp] 2 . Rp.e = X.Rp.Cp (8) 

Cancelling [Cp] and [Rp] from both sides of the equation: 
5 Cp.e = X (9) 

where [e] is the number of variants along a fixed row; and 

if Rp=Cp, then Rp.e = X. 

Example 

for a 10 x 10 x 8 x 8 format over 4 plates: 
10 np.e = 8 => e = 2 
10 x 2 = X 
X = 20. 
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From an understanding of the general deconvolution 
formulae shown above, those skilled in the art will 
readily appreciate that the advantageous results of self- 
deconvolution according to the invention are obtainable 
utilising a number of different arrangements of wells, 
plate layouts, mixtures etc and that such variants on the 
preferred embodiment illustrated herein are intended to be 
within the scope of the present invention. 



10 



The molecules may be cyclic or alicyclic. They may be 
linear or cyclic on the same structure 



B-C-D-E 



or 



B C 



E D 



or BCDE can be on a central scaffold {linear or cyclic) 



15 



1 1 1 r- 

B C D E or 




or one of the residues can be the backbone 



SUBSTITUTE SHEET (RULE 26) 



WO 97/42216 



PCT/GB97/01158 



Where the symbol -B-C-D-E- or -B-C-D-nE- is used herein, 
it is to be understood to include all of these linear and 
cyclic variants within its definition. 



At least one but not all the bonds between B, C, D, E need 
to be scissile. Non scissile bonds may include 
sulphonamide , urea , aminomethylene . 



10 Several non-limiting examples of "scaffold" molecules are 
shown in Table 2, in which substituents Ri-R, correspond to 
possible variable groups B, C, D and E. 



In a first aspect the invention provides novel compounds 
represented by the formula A-B-C-D-nE-F [I] in which; 
15 A represents a fluorescor internally quenched by F; 

B, C, D, and E represent groups such that the scissile 
bond between any two of these groups is a suitable bond; 

F represents a quencher capable of internally quenching 
the fluorescor A; and 

20 n represents an integer between 1 and 4 inclusive. 
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In some embodiments the suitable bond is an unsubstituted 
amide bond (see Example 1) ; in other embodiments the 
suitable bond is an ester bond (see Example 2) . 

In preferred embodiments B, C, D, E are amino acids or 
hydroxy acids. That is a molecule with an amine or 
hydroxy terminus and a carboxylic acid terminus. The 
amine/hydroxy group may be positioned on the same carbon 
atom or separated by a number of atoms and atom types. 

In a second aspect the invention provides a combinatorial 
library of FRET compounds comprising a mixture of 
compounds of formula [I] . 



15 



20 



25 



In a third aspect the invention provides for the use of 
such a combinatorial FRET library in a method which 
provides rapid generation of structure-activity 
relationships (SAR) which comprises detection and 
measurement of proteolytic enzyme activity by carrying out 
an assay with a library of combinatorial FRET 
(fluorescence resonance energy transfer) molecules to find 
a substrate or substrates for the enzyme. According to 
this method an identified substrate can be synthesised and 
used in biological assay for proteolytic enzymes. Novel 
substrates are included in the scope of the invention. 

In a forth aspect the invention provides for the use of 
such a combinatorial FRET library in a method for 
detection and measurement of proteolytic enzyme activity 
against compounds of the library. 
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In a fifth aspect the invention provides a method which 
comprises the identification of an enzyme inhibitor or 
inhibitors wherein a FRET compound which has been 
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identified as a substrate is used in an inhibition assay 
with the enzyme separately against a panel of possible 
inhibitors. ■ 

In a sixth aspect the invention provides a set of 
compounds which comprises two complementary FRET compound 
libraries. Such a set will be referred to hereafter as 
Apparatus" because it allows for the screening or assay 
method for identifying substrates or inhibitors of 
proteolytic enzymes. This set of compounds constituting an 
apparatus is capable of providing an auto-deconvo luting 
combinatorial library as will be described below. 

In a seventh aspect the invention provides a method of 
identifying and synthesising an inhibitor of a proteolytic 
enzyme which comprises detection and measurement of 
proteolytic enzyme activity by carrying out an assay with 
a library of combinatorial FRET (fluorescence resonance 
energy transfer) molecules, deconvoluting the library to 
find a substrate or substrates for the enzyme and 
synthesis of an inhibitor based on the substrate or 
substrates. The direct product of this method is one or 
more novel proteolytic enzyme inhibitors. 

In an eighth aspect the invention provides an inhibition 
assay which uses a FRET molecule, which has been 
identified as a substrate for the enzyme, wherein the 
molecule is assayed with the enzyme separately against a 
panel of possible inhibitors. 
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In a ninth aspect the invention provides a complementary 
pair of compound libraries LI and L2 which constitute a 
set containing compounds of formula: 



Aa-Bb-Cc-Dd-n(Ee) -Ff-Gg 



giving axbxcxdxexfxg = Mn compounds in each 
library, there being a predetermined number (Pi, P2) of 
mixtures each consisting of a predetermined number (Ql, 
Q2) of individual identifiable compounds in each library, 
wherein both LI and L2 contain the same Mn compounds, but 
wherein any two compounds which are found together in one 
mixture of Ql compounds of LI are not found together in 
any one of the P2 mixtures of L2 . 



In a tenth aspect the invention provides a method of 
screening for enzymic activity using the libraries LI, L2 
described above in which the PI mixtures of LI and the P2 
mixtures of L2 are each placed separately into individual 
wells of well plates, the well plates having wells 
arranged in a format adapted to allow deduction of a 
unique active compound formula from the presence of 
activity in one well of LI and one well of L2. 

The apparatus of the invention preferably comprises two 
complementary compound libraries, LI and L2, each 
containing n x 1600 compounds of the invention, of the 
type A-B^ -Cuo-D^.-nCEx.aJ-F-G [II], in which: 
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A = a fluorescor internally quenched by F, preferably an 
unsubstituted or substituted anthranilic acid derivative, 
connected by an amide bond to B 

B, C, D, E, are natural or unnatural amino acid residues 

5 connected together by suitable bonds, although B, C, D and 
E can be any set of groups, provided that the scissile 
bond between D-E is an unsubstituted bond. 

F = a quencher capable of internally quenching the 
fluorescor A, preferably an unsubstituted or substituted 
10 3 -nitro tyrosine derivative. 

G = optionally present and is a hydrophilic moiety, 
preferably an aspartyl amide moiety. If present, G 
advantageously ensures that all compounds in the library 
are imparted with aqueous solubility. Also, G should not 
is be a substrate for any type of enzyme. 

n = any integer between 1 and 4 inclusive. 

In an alternative, the scissile bond could be between B-C 
or C-D. 

(Note that A and F herein correspond generally and 
20 respectively to moieties F and Q of the prior art referred 
to above) . 

The numbers represented in subscript following residues B, 

C, D and E refer to the number of possibilities from which 
those residues are selected. Thus, by way of illustrative 

25 example, A-Bj^-C-D-E^j-F-G represents a mixture of the 
following ten compounds: 
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A-B.-C-D-Ei-F-G 
A-B 2 -C-D-Ei-F-G 
A-Bj-C-D-E^F-G 
A-B^C-D-E^F-G 
5 A-Bg-C-D-E^F-G 
A-B^C-D-E.-F-G 
A-Bi-C-D-Ea-F-G 
A-B 5 -C-D-E 2 -F-G 
A-B 4 -C-D-E 2 -F-G 
10 A-B 5 -C-D-E 2 -F-G 

The general combinatorial formula for each library can be 
expressed as: 

Ai-B.o-C^-Da-nfEJ -F.-G, [III] 

providing Ixl0xl0x8xnx2xlxl= 1600n 
is compounds . 

Both compound libraries, LI and L2, of the above type are 
synthesized using solid phase techniques using the 
Multipin approach' 4 such that each library contains 1600n 
compounds as 80n mixtures of 20 distinct, identifiable 
20 compounds. These 20 component mixtures are then placed 

separately into each of 80 wells of a 96 well plate (the 
other two lanes are used for control experiments) and then 
screened against a known quantity of the protease. 

Thus it is an important part of the invention that 
is regardless of the number of compounds contained in the two 
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libraries LI and L2 (e.g. in the preferred embodiment: 
1600n, where n = any integer between 1 and 4) the 
libraries themselves are complementary and amenable to 
deconvolution without recourse to resynthesis. It is also 
5 an important part of the invention that the library matrix 
has been especially formatted so that the most important 
site pairings P 2 and Pj for proteolytic enzymes can be 
identified immediately without recourse to resynthesis. 

Those compounds of the type A-B-C-D-E-F-G that are the 
10 better substrates for the protease will be cleaved, and 

can be readily identified because the fluorescor, A, will 
be cleaved from its nearby quencher F, in a time dependent 
manner which can be easily quantified. The fluorescent 
quenching by F of A only occurs when the two are in nearby 
15 proximity, normally within 30 angstrom units. Hence 

cleavage of a scissile bond (e.g. the scissile bond D-E) 
allows F to move further away from A and thus allow A to 
fluoresce when excited by light of the correct wavelength. 



1 . In this manner the most active compound can be 
20 rapidly identified without the need for further 

resynthesis and deconvolution. Moreover, the wells 
that show the most rapid development of fluorescence 
can also be analysed by mass spectrometry, since by 
comparison with the original mixture, the identity of 
25 the most efficient substrate can be found by its 

disappearance into its two component parts, e.g A-B- 
C-D and E-F-G. 
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Hence the problem of library deconvolution can be overcome 
and the most active substrate for the enzyme can be 
rapidly identified. 

In addition, after the initial treatment of the 
proteolytic enzyme with the library mixtures, LI and L2, 
the residual enzymatic activity in each well can be 
quantified by the addition of the most potent fluorogenic 
substrate for the enzyme, SI, which is found in the 16xn 
compound library. Because of the nature of the library 
design this can be quickly prepared and purified. If there 
is no appearance of increased fluorescence with the known 
substrate, SI, then the presence of an enzyme inhibitor 
can be inferred, which again can be quickly identified 
without the need for resynthesis. 

The general description of the library layout will now be 
described with reference to figures 1 to 14 . 

For example, when n=l and the library contains 1600 
compounds, in the first column of the first row (Al) 
(Fig.l) in the first plate (PI) of the library LI, 
(hereinafter designated as location A1,P1,L1) there will 
be one C component, C l# one D component, D 1# the ten B 
components and the two E components (E x and E 2 ) (Fig. 2) . 
In the tenth column of the first row (A10) in the first 
plate (PI) of the library LI, (hei&Lnaf ter designated as 
location A10,P1,L1) there will be one C component, C 10 , one 
D component, D lf the ten B components and the two E 
components (E l and E 2 ) . In the tenth column of the eighth 
row (H10) in the first plate (PI) of the library LI, 
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(hereinafter designated as location H10,P1,L1) there will 
be one C component, C 10 , one D component, D e , the ten B 
components and the two E components (E x and E 2 ) . Hence all 
1600 components are present in the one plate, because the 
5 80 wells each contain 20 components . 

A second complementary library is synthesised as follows 
(Fig. 3) . In the first column of the first row (Al) of the 
first plate (PI) of the library, L2, (hereinafter 
designated as location A1,P1,L2), there will be ten C 

10 components, two D components (D x and D 2 ) , one B component, 
B, , and one E component , E, . In the tenth column of the 
first row (A10) of the first plate (PI) of the library, 
L2, (hereinafter designated as location A10 , PI , L2 ) , there 
will be ten C components, two D components (Di and D 3 ) , one 

15 B component, B l0 , and one E component, E x . In the first 

column of the second row (Bl) of the first plate (PI) of 
the library, L2, (hereinafter designated as location 
B1,P1,L2), there will be ten C components, two D 
components (Dj and D 2 ) , one B component, B p and one E 

20 component, E 2 . In the tenth column of the second row (B10) 
of the first plate (PI) of the library, L2, (B10,P1,L2) 
there will be ten C components, two D components (D. and 
D 2 ) , one B component, B 10 , and one E component, E 2 . Hence 
only the first two rows are used to accommodate 400 

25 compounds in total . 

In the first column of the first row (Al) of the second 
plate (P2) of the library, L2, (hereinafter designated as 
location A1,P2,L2), there will be ten C components, two D 
components (D 3 and D 4 ) , one B component, B x/ and one E 
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component, E x {Fig. 4) . In the tenth column of the first 
row <A10) of the second plate (P2) of the library, L2, 
(hereinafter designated as location A10,P2,L2), there will 
be ten C components, two D components (D 3 and DJ , one B 
component, B 10 , and one E component, E x . In the first 
column of the second row (Bl) of the second plate (P2) of 
the library, L2, (hereinafter designated as location 
B1,P2,L2), there will be ten C components, two D 
components (D 3 and DJ , one B component, B lt and one E 
component, E 2 . In the tenth column of the second row (BIO) 
of the second plate (P2) of the library, L2, (B10,P2,L2), 
there will be ten C components, two D components (D 3 and 
D J , one B component, B 10/ and one E component, E 2 . Hence 
only the first two rows are used to accommodate 400 
compounds in total. 



In the first column of the first row (Al) of the third 
plate (P3) of the library, L2, (hereinafter designated as 
location A1,P3,L2), there will be ten C components, two D 
components (D 5 and DJ , one B component, B w and one E 
component, E 1 (Fig 5). in the tenth column of the first row 
(A10) of the third plate (P3) of the library, L2, 
(hereinafter designated as location A10,P3,L2), there will 
be ten C components, two D components (D s and D J , one B 
component, B l0/ and one E component, E x . In the first 
column of the second row (Bl) of the third plate (P3) of 
the library, L2, (hereinafter designated as location 
B1,P3,L2), there will be ten C components, two D 
components (D 5 and D 6 ), one B component, B w and one E 
component, E 3 . In the tenth column of the second row (BIO) 
of the third plate (P3) of the library, L2, (B10,P3,L2), 
there will be ten C components, two D components (D 5 and 
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D 6 ) , one B component, B 10 , and one E component, E 2 . Hence 
only the first two rows are used to accommodate 400 
compounds in total . 

In the first column of the first row (Al) of the fourth 
5 plate <P4) of the library, L2, (hereinafter designated as 
location A1,P4,L2), there will be ten C components, two D 
components (D 7 and D e ) , one B component, B 1# and one E 
component, E x (Fig. 6) . In the tenth column of the first 
row (A10) of the fourth plate (P4) of the library, L2, 

10 (hereinafter designated as location A10,P4,L2), there will 
be ten C components, two D components (D 7 and D 8 ) , one B 
component, B 10 , and one E component, Ej . In the first 
column of the second row (Bl) of the fourth plate (P4) of 
the library, L2, (hereinafter designated as location 

15 B1,P4,L2), there will be ten C components, two D 

components (D 7 and D 8 ) , one B component, B lf and one E 
component, E 2 . In the tenth column of the second row (BIO) 
of the fourth plate (P4) of the library, L2, (B10,P4,L2), 
there will be ten C components, two D components (D 7 and 

20 D 8 ) , one B component, B 10 , and one E component, E 2 . Hence 
only the first two rows are used to accommodate 400 
compounds in total. 

In this fashion two complementary libraries, LI and L2 are 
prepared. In library, LI, each of the 80 of wells 
25 contains a mixture of 20 components providing 1600 

compounds for screening. In library, L2, four plates are 
used in which only the first two rows are employed, 
providing 20 wells of 20 components per well per plate, 
and furnishing the same 1600 compounds as are present in 
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library LI, but in a format in which no two compounds 
found together in library, LI, will be found together in 
library, L2.. 



Thus it is an important part of the invention that the 
5 compounds contained in the two libraries LI and L2 are 

themselves complementary, in that any two compounds which 
are found together in a 20 component mixture in the same 
location (e.g. A1P1L1) in library LI, are not found 
together in any of the 20 component mixtures in any 
io location of the library L2. 

Thus, for example, with reference to the primary library 
PI LI of figure 2 and the secondary libraries PI L2, P2 , 
L2, P3 L2 and P4 L2 of figures 3-6 it is possible to 
deconvolute an exemplary sequence: 



15 



-B 2 -C 3 -D 4 -E 3 - 

If this sequence is a substrate fluorescence will occur in 
PI LI at C 3 D 4 . This gives the information that the 
substrate is 



-?-C 3 -D 4 -?- 

20 If fluorescence occurs in P2 L2 at BA it indicates a 
substrate 
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The confirmation of the substrate as 

should be provided by non-f luoreac^rp of PI L2, P3 L2 and 
P4 L2 which all contain -B 2 -C 3 -X-E a - where X is not D 4 . 

5 In practice it is likely that more than one sequence will 
result in a substrate7^"lnf ormation as to which positions 
B-C-D-E- are sensitive to change (i.e. require a specific 
group) and which are insensitive (i.e. can tolerate more 
than one choice of group) in the context of the whole 
10 sequence gives valuable SAR data which can be used to 
model and/ or synthesise related compounds. 

/ 

In analogous examples/ where separately n=2, 3 or 4, extra 
plates are constructed in library LI format to accommodate 
the component pairs E 3 and E« (n =2) , E 5 and E 6 (n = 3), and 
is E 7 and E e (n = 4) , respectively. For the respective 

deconvolution libraries of the type, L2, the respective 
rows in the plates PI, P2, P3, and P4, are increasingly 
filled with the pairedycomponents D x and D 2 , D 3 and D 4 , and 
D s and D 6/ and D 7 and D 8 , respectively. 

20 ' For example, when n = 3, and the library contains 4800 
compounds, in the first column of the first row (Al) in 
the first plate (PI) of the library LI, (hereinafter 
designated as location A1,P1,L1) there will be one C 
component, C w one D component, D lf the ten B components 

25 and the two E components (E x and E 2 ) . In the tenth column 
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of the first row (A10) in the first plate (PI) of the 
library LI, (hereinafter designated as location A10,P1,L1) 
there will be one C component, C 10 , one D component, D lf 
the ten B components and the two E components ^ and E 3 ) . 
In the tenth column of the eighth row (H10) in the first 
plate (PI) of the library LI, (hereinafter designated as 
location H10,P1,L1) there will be one C component, C 10 , one 
D component, D a , the ten B components and the two E 
components (E x and E 2 ) . Hence 1600 components are present 
in the one plate, because the 80 wells each contain 20 
components . 



In the first column of the first row (Al) in the second 
plate (P2) of the library LI, (hereinafter designated as 
location A1,P2,L1) there will be one C component, C x# one 
D component, D w the ten B components and the two E 
components (E 3 and EJ . In the tenth column of the first 
row (A10) in the second plate (P2) of the library LI, 
(hereinafter designated as location A10,P2,L1) there will 
be one C component, C I0 , one D component, D lf the ten B 
components and the two E components (E 3 and E 4 ) . In the 
tenth column of the eighth row (H10) in the second plate 
(P2) of the library LI, (hereinafter designated as 
location H10,P1,L1) there will be one C component, C 10 , on 
D component, D B , the ten B components and the two E 
components (E 3 and E 4 ) . Hence 1600 components are present 
in the one plate, because the 80 wells each contain 2 0 
components . 

In the first column of the first row (Al) in the third 
plate (P3) of the library LI, (hereinafter designated as 
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10 



location A1,P3,L1) there will be one C component, C x , one 
D component, D lt the ten B components and the two E 
components (E s and E 6 ) . In the tenth column of the first 
row (A10) in the third plate (P3) of the library LI, 
(hereinafter designated as location A10,P3,L1) there will 
be one C component, C 10 , one D component, D l§ the ten B 
components and the two E components (E s and EJ . In the 
tenth column of the eighth row (H10) in the third plate 
(P3) of the library LI, (hereinafter designated as 
location H10,P3,L1) there will be one C component, C 10 , one 
C component, C 8 , the ten B components and the two E 
components (E 5 and E 6 ) . Hence 1600 components are present 
in the one plate, because the 80 wells each contain 2 0 
components. In total the three plate, PI, P2 and P3, 
is contain 1600 compounds /plate 4800 compounds in total. 

For example, when n = 4, and the library contains 6400 
compounds, in the first column of the first row (Al) in 
the first plate (PI) of the library LI, (hereinafter 
designated as location A1,P1,L1) there will be one C 

20 component, C 1# one D component, D lt the ten B components 

and the two E components (Ej and E 2 ) (Fig. 7) . In the tenth 
column of the first row (A10) in the first plate (PI) of 
the library LI, (hereinafter designated as location 
A10,P1,L1) there will be one C component, C 10f one D 

25 component, D 1# the ten B components and the two E 

components (E x and E 2 ) . In the tenth column of the eighth 
row (H10) in the first plate (PI) of the library LI, 
(hereinafter designated as location H10,P1,L1) there will 
be one C component, C l0 , one D component, D 8 , the ten B 

30 components and the two E components (E L and E 7 ) . Hence all 
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1600 components are present in the one plate, because the 
80 wells each contain 20 components. 



In the first column of the first row (Al) in the second 
plate (P2) of the library LI, (hereinafter designated as 
location A1,P2,L1) there will be one C component, C,, one 
D component, D lt the ten B components and the two E 
components (E, and E 4 ) (Fig. 8). In the tenth column of the 
first row (A10) in the second plate (P2) of the library 
LI, (hereinafter designated as location A10,P2,L1) there 
will be one C component, C 10 , one D component, D lf the ten 
B components and the two E components (E 3 and E 4 ) . in the 
tenth column of the eighth row (H10) in the second plate 
(P2) of the library LI, (hereinafter designated as 
location H10,P2,L1) there will be one C component, C l0 , one 
15 D component, D,, the ten B components and the two E 
components (E, and EJ . 



10 



20 



25 



In the first column of the first row (Al) in the third 
plate (P3) of the library LI, (hereinafter designated as 
location A1,P3,L1) there will be one C component, C lf one 
D component, D,, the ten B components and the two E 
components (E s and E t ) (Fig. 9) . In the tenth column of the 
first row (A10) in the third plate (P3) of the library LI, 
(hereinafter designated as location A10,P3,L1) there will 
be one C component, C 10 , one D component, D,, the ten B 
components and the two E components (E s and EJ . in the 
tenth column of the eighth row (H10) in the third plate 
(P3) of the library LI, (hereinafter designated as 
location H10,P3,L1) there will be one C component, C l0 , one 
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D component, D e , the ten B components and the two E 
components (E s and E 6 ) . 

In the first column of the first row (Al) in the fourth 
plate (P4) of the library LI, (hereinafter designated as 
s location A1,P4,L1) there will be one C component, C w one 
D component, D w the ten B components and the two E 
components (E 7 and E e ) (Fig. 10). Likewise, in the tenth 
column of the first row (A10) in the fourth plate (P4) of 
the library LI, (hereinafter designated as location 

10 A10,P4,L1) there will be one C component, C 10 , one D 
component, D l# the ten B components and the two E 
components (E 7 and E 8 ) . In the tenth column of the eighth 
row (H10) in the fourth plate (P4) of the library LI, 
(hereinafter designated as location H10,P4,L1) there will 

is be one C component, C 10 , one D component, D 8 , the ten B 
components and the two E components (E 7 and E 8 ) . 

A second complementary library is synthesised as follows. 
In the first column of the first row (Al) of the first 
plate (PI) of the library, L2, (hereinafter designated as 

20 location A1,P1,L2), there will be ten C components, two D 
components (D x and D 2 ) , one B component, B w and one E 
component, E x (Fig. 11) . In the tenth column of the first 
row (A10) of the first plate (PI) of the library, L2, 
(hereinafter designated as location A10,P1,L2), there will 

25 be the ten C components, two D components (D x and D 2 ) , one 
B component, B i0 , and one E component, Ej . In the first 
column of the eighth row (HI) of the first plate (PI) of 
the library, L2 , (hereinafter designated as location 
H1,P1,L2), there will be the ten C components, two D 
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components (D, and D J , one B component, B lt and one E 
component, E 8 . In the tenth column of the eighth row (H10) 
of the first. plate (PI) of the library, L2, (H10,P1,L2) 
there will be the ten C components, two D components (D 1 
and DJ, one B component. B 10 , and one E component, E,. 
Hence the matrix containing all ten columns and all eight 
rows are used to accommodate 1600 compounds in total. 



10 



15 



20 



25 



In the first column of the first row (Al) of the second 
plate (P2) of the library, L2. (hereinafter designated as 
location A1,P2,L2). there will be ten C components, two D 
components <D 3 and D J , one B component, B lf and one E 
component, E, (Fig. 12) . m the tenth column of the first 
row (A10) of the second plate (P2) of the library, L2, 
(hereinafter designated as location A10,P2,L2), there will 
be ten C components, two D components (D, and DJ , one B 
component, B 10 , and one E component, E x . In the first 
column of the second row (Bl) of the second plate (P2) of 
the library, L2, (hereinafter designated as location 
B1,P2,L2), there will be ten C components, two D 
components (D } and D J , one B component, B x , and one E 
component, E 2 . in the tenth column of the eighth row (H10) 
of the second plate (P2) of the library, L2, (H10,P2,L2), 
there will be ten C components, two D components (D 3 and 
DJ , one B component, B l0 , and one E component, E,. 

In the first column of the first row (Al) of the third 
plate (P3) of the library, L2, (hereinafter designated as 
location A1,P3,L2), there will be ten C components, two D 
components (D 5 and DJ , one B component, B„ and one E 
component. E, (Fig. 13). In the tenth column of the first 
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row (A10) of the third plate (P3) of the library, L2, 
(hereinafter designated as location A10,P3,L2), there will 
be ten C components, two D components (D s and DJ , one B 
component, B 10 , and one E component, E x . In the first 

5 column of the second row (Bl) of the third plate (P3) of 
the library, L2, (hereinafter designated as location 
B1,P3,L2), there will be ten C components, two D 
components (D s and DJ , one B component, B x , and one E 
component, E 2 . In the tenth column of the eighth row (H10) 

o of the third plate (P3) of the library, L2, (H10, P3,L2) , 
there will be ten C components, two D components (D s and 
D 6 ) , one B component, B 10 , and one E component, E e . 



In the first column of the first row (Al) of the fourth 
plate (P4) of the library, L2, (hereinafter designated as 

is location Al,P4 f L2), there will be ten C components, two D 
components (D 7 and D fl ) , one B component, B 1# and one E 
component, E, (Fig. 14) . In the tenth column of the first 
row (A10) of the fourth plate (P4) of the library, L2, 
(hereinafter designated as location A10,P4,L2), there will 

20 be ten C components, two D components (D 7 and D 8 ) , one B 
component, B 10 , and one E component, E x . In the first 
column of the second row (Bl) of the fourth plate (P4) of 
the library, L2, (hereinafter designated as location 
B1,P4,L2), there will be ten C components, two D 

25 components (D 7 and D J , one B component, B 1# and one E 

component, E 2 . In the tenth column of the eighth row (H10) 
of the fourth plate (P4) of the library, L2, (H10,P4,L2), 
there will be ten C components, two D components (D 7 and 
D J , one B component, B 10 , and one E component, E e . 
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The FRET strategy is based on the synthesis of two 
orthogonal sets of mixtures in solution. These solutions 
are each indexed in two dimensions. Thus the data from, 
for example, a protease scan identifies the most active 
compounds without the need for decoding or resynthesis. 
The positional preferences of sub-units (in this case 
amino acids) are optimised with respect to all other 
variant positions simultaneously. The synergistic 
relationship between all four positions is realised and 
both positive, beneficial and negative, deactivating data 
are generated. This leads to families (sub-populations) 
of substrates and their sub-unit preferences. The data 
can be fed into molecular modelling programs to generate 
pharmacophore descriptors that encompass both the 
is desirable features (from the positive data) and indicate 
undesirable interactions (from the negative data sets) . 
Note that a one dimensional scan only indicates one 
position at a time as 'most active 1 and does not explore 
the synergistic relationship between positions. 



10 



The general methodology exemplified above with regard to 
the use of complementary combinatorial FRET libraries for 
the identification of proteolytic enzyme substrates, is 
equally applicable for identification of compounds from a 
library which interact with another active moiety. 
Combinatorial libraries of compounds containing four 
variable groups B, c, D and E can be produced an 
interactions with active moieties detected using suitable 
reporters or markers. 
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For example, the active moiety may be a protein kinase. 
Protein kinases include (but are not limited to) the 
following: 

ZAP-70, Syk, P 56 p59'v\ Yes, Hck, Src, Btk, Blk, Lyn, 
5 Raf , EGF kinase, Insulin receptor kinase, cyclin dependent 
kinases, Bcl-Abl, lkB-kinase C- terminal kinases, lkB N- 
terminal kinases, Jak kinases, MAP kinase kinases, MAP 
kinase kinase kinases, MAP kinases (Erks 1-3, p38 and Jun 
kinases), STAT family C- terminal serine /threonine kinases, 
io protein kinase A, protein kinase B, all protein kinase C 
isof orms . 

The active interaction of a protein kinase with a compound 
in a library may be detected, for example, by 
radioactively labelled phosphorylation of the substrate in 
is the library. 

Thus it is possible to identify peptide substrates of 
protein tyrosine kinase by phosphorylation of exogenous 
peptide substrates present in a library. The assay uses 
the transfer of 33 P from radiolabeled ATP to lysine-tagged 

20 tyrosine-peptides substrate as a measure of kinase 
activity. The peptide binds to negatively charged 
membrane and is washed free of unincorporated [y-"P]-ATP. 
The detection of (S-energy emission using a fluor- 
containing scintillant on a p-counter instrument provides 

25 the measurement of peptide phosphorylation. 
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In another example the active moiety could be a 
glycosidase and the interaction would be with enzyme 
substrates in the library. In such a case a scissile bond 
between at least one of the groups B, C, D and E would be 
an acetal or glycosidic bond. The groups B, C, D and E 
could be selected from furanosides and pyranosides. 



In another example the active moiety could be a nuclease . 
In such a case the scissile bond would be a phosphodi ester 
bond and the groups B, C, D and E would be nucleotides. 



10 The general methodology of the present invention can also 
be applied to the detection of an interaction between an 
active moiety such as. a receptor and a library containing 
ligands for the receptor. Examples of receptor/ligand 
interactions are shown by way of non- limiting illustration 

15 below: 



Chemokine receptor/ chemokine i.e. CKR3/Eotaxin 

SH3 domain/proline amino acid sequence i.e. Grb2/S0S 
peptide 

WW domain/proline amino acid sequence i.e. Yap (Yes- 
associated protein) /RSV Gag protein 

SH2 domain/phosphopeptide i.e. Grb2/LNK phosphopeptide 
Lectin domain/1 igand i.e. CD72/CD5. 



Further more, combinatorial 
or antigenic epitopes could 
an active antibody moiety. 



libraries of possible antigens 
be tested for interaction with 
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The reporter or marker for detection of a reaction in 
receptor/ ligand and antibody/antigen libraries could be 
similar. Suitable reporters are, for example, (but not 
limited to) : 

5 1. Biotin tag on the ligand using streptavidin-HRP for 
detection, 

2. Alkaline phosphatase tag, 

3. Radioisotope tag. 

The invention will now be described by reference to the 
10 following examples. 

Example 1: Cysteine Protease Inhibitors and Substrates 

In this Example the proteolytic enzyme of interest is Der 
PI, which is found in house dust mite faeces. The example 
illustrates the synthesis of a number of FRET compounds in 
15, which the suitable bond is an unsubstituted amide bond, 
their use as a library for screening for potential 
substrates of Der PI, and subsequent identification and 
synthesis of active inhibitors of the enzyme. 

Purification of Der pi. 

20 Crude mite extract (~100mg, SmithKline-Beecham, U.K) was 
dissolved in 5mL Phosphate Buffered Saline (PBS; 50 mM 
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potassium phosphate; pH 7.4 containing 150 mM NaCl Der pi 
was purified by affinity column chromatography using 4C1 
antibody (indoor Biotechnology, Deeside, U.K.) The crude 
preparation was mixed with ~2 mL of affinity resin for 2 h 
at 4°C and then washed with 2-3 volumes of PBS. Elution of 
bound protein was carried out using 5mM glycine containing 
50%{v/v) ethylene glycol. Fractions (2 .2 mL) were collected 
and neutralised with 0.8 mL of 0.2 M sodium phosphate 
buffer, pH 7.0. The fractions were pooled and dialysed 
overnight against 4 L PBS followed by a second dialysis 
against 2 L PBS for 2-3h. The total protein was 
concentrated as required by ultrafiltration (MacroSep; 
Flowgen, U.K. ) 



Synthesis of compounds 



The compounds were synthesised using the Multipin 
approach 25 -" using Fmoc-Rink amide Macro crowns (Chiron 
Mimotypes Pty., Ltd., 11 Duerdin Street, Clayton Victoria 
3168, Australia) with a loading of 7 pMoles. 

The amino acid residues of each of the compounds were 
linked using amide bonds in a suitable form. The coupling 
chemistry employed is similar to that reported in the 
literature 27 for f luorenylmethoxycarbonyl protected amino 
acids and activated pentaf luorophenyl esters, in which the 
side-chains are protected using acid labile protecting 
groups known to those skilled in the art, such as Boc- 
(for the -NH 2 of Lysine, -NH 2 of anthranilic acid and 
guanidino of arginine) , tBu- (for the -OH groups of 
serine, threonine and tyrosine), t-Bu for the -COOH group 
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of Aspartic acid and Glutamic acid, Trityl- {for the Amide 
of Asparagine and Glutamine, and the amine functionality 
of the Histidine ring. 

The N-a-f luorenylmethoxycarbonyl protecting group of the 
5 coupled residues were cleaved using 20% piperidine in 
dimethyl formamide (DMF) for 30 minutes at 20° C. The 
coupling reactions for the free acids such as Boc-ABz-OH 
(Boc-2-aminobenzoic acid), and Fmoc- (3-nitro) tyrosine-OH 
were accomplished using 10 equivalents of a mixture of the 
10 free acid (1 eq.) :TBTU (0.98 eq.): HOBt (0.98 eq.) : N- 
methylmorpholine (1.96 eq.) in dimethyl formamide (500 /iL) 
as solvent for 5 hours at 20° C. The other amino acids were 
coupled as their pentaf luorophenyl esters 26 for 2-6 hours. 

Hence, in order to couple approximately equal ratios of 
15 each component in the mixture of the derivatised amino 

acids as their pentaf luorophenyl esters, a solution of a 
total of 0.98 equivalents (relative to the amino group 
loading on the crown) of the mixture of amino acid 
pentaf luorophenyl esters : HOBT (1 eq.) in DMF (500 /zL) 
20 were coupled for 16 hours at 20° C. The pins were then 
washed well with DMF and then recoupled using the same 
mixture under the same conditions. A third coupling of 10 
equivalents (relative to the amino group loading of the 
crown) for 2 hours in DMF was performed using this 
25 coupling protocol with equimolar mixtures of the 

derivatised pentaf luorophenyl esters of the amino acids in 
slightly less than 1 equivalent, it is possible to obtain 
approximately equal amounts of the coupled products to the 
crown. In this fashion the libraries are constructed with 
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20 compounds present on each crown. The compounds were 
cleaved from the crowns directly into the 80 designated 
wells of the. desired 96 well plate. In the cleavage 
protocol each crown was treated with a mixture (600 /xt) 
containing trif luoroacetic acid (95%), triethylsilane (5%) 
for 2 hours at 20° C. The crowns were then washed with 
trifluoroacetic acid (500 jiL) and this was then combined 
with the cleavage solution. 



10 



15 



The Fmoc-Rink amide Macro crowns (Chiron Mimotypes Pty., 
Ltd., 11 Duerdin Street, Clayton Victoria 3168, Australia) 
at 7 /iMol loading per crown, were coupled with a 10 fold 
excess of a mixture containing L-Fmoc-Asp (O-t-Bu) -OH (leq) 
using TBTU (0.98 eq) and N-methylmorpholine {1.96 eq.) in 
the presence of HOBt (0.98 eq.) in DMF at 0.14M 
concentration. After deblocking of the Fmoc group with 20% 
piper idine in DMF for 30 minutes and subsequent washing 
with DMF and then methanol, coupling of the Fmoc- (3- 
nitro) tyrosine-OH was accomplished using 10 equivalents of 
a mixture of the Fmoc- (3 -nitro) tyrosine-OH (1 eq.) :TBTU 
(0.98 eq.): HOBt (0.98 eq. ) : N-methylmorpholine (1.96 
eq.) in dimethyl formamide as solvent at 0.14 M 
concentration for 5 hours at 20° C. Removal of the Fmoc 
group (vide infra) was followed by coupling of the 
mixtures of amino acids in the ratios outlined and under 
25 the conditions described (vide infra). 

In a particular example the amino acids comprising group B 
include Ala, Val, Leu, Ser, Asn, Gin, Glu, Lys, Phe, Pro. 
The amino acids comprising group C include Ala, Val, Leu, 
Ser, Asn, Gin, Glu, Lys. Phe. Pro. The amino acids 



20 
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comprising group D include Ala, Val, He, Leu, Nle, Ser, 
Glu, Phe. For n=4, the amino acids comprising group E 
include Ala, .Val, He, Leu, Nle, Ser, Glu, Phe. Otherwise 
any selection from the amino acids can be made for n=l, 2, 
5 or 3. 

The plates containing the combined cleavage solutions and 
were then evaporated to dryness to yield the component 
mixtures using a rotary centrifuge ( "SPEEDVAC , Savant 
Instruments Inc., Farmingdale, NY) at 800 rpm for 1 hour 

10 at 20° C under a reduced pressure of 10 3 mmHg. Each 

component was then transferred to the final mother plate 
using a (50%: 45%: 5%) mixture of acetonitrile : water: 
acetic acid. The plates were then lyophilised to dryness 
using at 20° C under a reduced pressure of 10* 2 mrnHg, and 

is then stored at -20° C. In this fashion libraries of the 
type shown in Figures 2-14 were prepared. 

In further detail, the Multipin approach which was 
employed is described below: 

Multipin Synthesis Of Potential Substrates of Der pi 

20 The * Chiron' multipin kit consists of a standard 8 x 12 
pin holder containing 96 % pin stems' to which are 
reversibly attached * crowns' . The * crowns' provide a 
reactive polymer surface upon which a growing peptide is 
anchored during solid phase peptide synthesis. Each crown 

25 (the equivalent of the peptide-resin in standard solid 
phase synthesis) can be considered to be an independent 
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reactor by performing simultaneous synthesis in individual 
lmL wells of industry standard 96 well plates. Each well, 
and thus each crown, can be charged with a unique set of 
reagents providing unique sequences to each crown. Common 
steps such as washing or removal of Na protection can be 
performed concomitantly. 

Synthesis is based upon the use of Na- 

fluorenylmethyloxycarbonyl (Fmoc) protected amino acids. 
Side-chains of tri- functional amino acids are protected 
with acid labile groups such as trityl or tert-butyl. The 
addition of amino-acid residues to the growing peptide 
chain, a process termed • coupling' proceeds through the 
utilisation of pre-formed pentaf luorophenyl (pfp) esters 
or activation of the free acid, using the reagents HBTU or 
BOP in the presence of tertiary base (NMM) and HOBt as 
catalyst. 

The experimental techniques used are fully documented 
(Maeji, N. J. Bray, A. M. Valerio, R. M . and Wang, W., 
Peptide Research, 8(1), 33-38, 1995 and Valerio, R. M. 
Bray, A. M. and Maeji, N. J. Jnt. J. Pept. Prot. Res, 44, 
158-165, 1994) and the main steps are briefly as follows. 

General Methods 

Preparation of Multipin Assembly 

Whilst wearing standard plastic gloves, Fmoc-Rink Amide 
derivitized macrocrowns are assembled (simply clipped) 
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onto stems and slotted into the 8 x 12 stem holder in the 
desired pattern for synthesis. 

Removal of Na-Fmoc Protection 

A 250mL solvent resistant bath is charged with 200 ml of a 
5 20% piperidine/DMF solution. The multipin assembly is 

added and deprotection allowed to proceed for 30 minutes. 
The assembly is then removed and excess solvent removed by 
brief shaking. The assembly is then washed consecutively 
with (200mL each) , DMF (Smins) and MeOH (5mins, 2mins, 
10 2mins) and left to air dry for 15mins. 

Quantitative UV Measurement of Fmoc Chromophore Release 

A 1cm path length UV cell is charged with 1.2mL of a 20% 
piperidine/DMF solution and used to zero the absorbance of 
the UV spectrometer at a wavelength of 290nm. A UV 
standard is then prepared consisting of 5.0mg Fmoc- 
Asp(OBut) -Pepsyn KA (0.08mmol/g) in 3.2mL of a 20% 
piperidine/DMF solution. This standard gives Abs 290 = D.55- 
0.65 (at RT) . An aliquot of the multipin deprotection 
solution is then diluted as appropriate to give a 
theoretical Abs 290 = 0.6, and this value compared with the 
actual experimentally measured absorbance showing the 
efficiency of previous coupling reaction. 

Coupling of amino- acid residues 
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Whilst the multipin assembly is drying, the appropriate 
Na-Fmoc amino acid pfp esters (10 equivalents calculated 
from the loading of each crown) and HOBt (10 equivalents) 
required for the particular round of coupling are 
accurately weighed into suitable containers. 
Alternatively, the appropriate Na-Fmoc amino acids (10 
equivalents calculated from the loading of each crown) , 
desired coupling agent e.g. HBTU (9.9 equivalents 
calculated from the loading of each crown) and activation 
eg HOBt (9.9 equivalents calculated from the loading of 
each crown), NMM (19.9 equivalents calculated from the 
loading of each crown) are accurately weighed into 
suitable containers. 



The protected and activated Fmoc amino acid derivatives 
are then dissolved in DMF (500/xl for each macrocrown, e.g. 
for 20 macrocrowns, 20 x lOeq x 7mmoles of derivative 
would be dissolved in 10 000/iL DMF) . The appropriate 
derivatives are then dispensed to the appropriate wells 
ready for commencement of the * coupling cycle' . As a 
standard, coupling reactions are allowed to proceed for 2- 
6 hours (depending upon nature of coupling e.g. Ala to Ala 
2 hours Val to Leu 6 hours. 



When coupling Fmoc amino-acid pentaf luorophenyl esters, 
lOeq of derivative in DMF (400/zl) with bromophenol blue 
stock solution (100^1) is used for each macrocrown. This 
allows monitoring of the progress of the acylation 
reaction through the disappearance of the deep blue 
coloration of bromophenol blue in the presence of 
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unreacted amine to a pale yellow upon completion of 
acylation. 

Preparation of Bromophenol Blue Stock Solution 

Bromophenol blue (20mg) is dissolved in DMF (50mL) and 
5 HOBt (lOmg) added. 

Washing Following Coupling 

If a 20% piperidine/DMF deprotection is to immediately 
follow the coupling cycle, then the multipin assembly is 
briefly shaken to remove excess solvent washed 

10 consecutively with (200mL each) , MeOH (Smins) and DMF 
{Smins) and deprotected (see above) . If the multipin 
assembly is to be stored, then a full washing cycle 
consisting brief shaking then consecutive washes with 
(200mL each), DMF (Smins) and MeOH (5mins, 2mins, 2mins) 

is is performed. 

Acidolytic Mediated Cleavage of Peptide-Pin Assembly 

Acid mediated cleavage protocols are strictly performed in 
a fume hood. A polystyrene 96 well plate (lmL / well) is 
labelled, then the tare weight measured to the nearest mg. 
20 Appropriate wells are then charged with a trif luoroacetic 
acid / triethylsilane (95:5, v/v, 600/zl) cleavage 
solution, in a pattern corresponding to that of the 
multipin assembly to be cleaved. 
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The multipin assembly is added, the entire construct 
covered in tin foil and left for 2hrs. The multipin 
assembly in then added to another polystyrene 96 well 
plate (lmL /well) containing trif luoroacetic acid / 
triethylsilane (95:5, v/v, 600 M 1) (as above) for 5 mins. 

The cleaved assembly is washed with DMF (200^L, 5mins) , 
MeOH (200//L, 5mins) , the spent crowns removed and 
discarded, the stems removed and washed by sonication in 
methanol (lhr, RT) . 



Work up of Cleaved Peptides 

The primary polystyrene cleavage plate (2hr cleavage) and 
the secondary polystyrene plate (5min wash) (see above) 
are then placed in the SpeedVac and the solvents removed 
(minimum drying rate) for 90mins. 

The contents of the secondary polystyrene plate (see 
above) are transferred to their corresponding wells on the 
primary plate using an acetonitrile / water / acetic acid 
(50:45:5, v/v/v) solution (3 x 150^1) and the spent 
secondary plate discarded. 



20 Analysis of Products 

1.0/xl of each well (see above) is diluted to 400/xl with 
0.1 %aq TFA and analysed by HPLC-MS. Column Vydac C4 
(214TP52, narrowbore, 21 x 250mm). Eluents :- Solvent A = 



15 
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0.1%aq trifluoroacetic acid, Solvent B = acetonitrile / 
10%A. Gradient :- 10-90% B in A over 27mins, 250ml / min, 
215nm UV detection. The individual substrates described 
below were prepared by the above methods and shown by 
HPLC-MS to be >95% with the correct mass. 

Final Lyophilisation of Peptides 

The primary polystyrene plate (plus the washings from the 
secondary plate) is covered with tin foil, held to the 
plate with an elastic band. A pin prick is placed in the 
foil directly above each well and the plate placed at 
-80°C for 30mins. The plate is then lyophilised on the 
'Heto freeze drier' overnight. Where appropriate 
individual peptides were then weighed and dissolved to 
lOmM stock solutions in DMSO prior to biological 
screening. Alternatively the 20 component mixture is 
weighed and the peptide/20 component ratio is calculated. 

Further coupling of amino acid residues was carried out 
according to the multipin approach described above. Whilst 
the multipin assembly was drying, the appropriate Ncr-Fmoc 
amino acids (10 equivalents calculated from the loading of 
each crown), HATU coupling agent (9.9 equivalents 
calculated from the loading of each crown) , HOAt catalyst 
(9.9 equivalents calculated from the loading of each 
crown) and DIPEA (19.9 equivalents calculated from the 
loading of each crown) were accurately weighed into 
suitable containers. 



SUBSTITUTE SHEET (RULE 26) 



WO 97/42216 



PCT/GB97/01158 



- 55 - 

The protected Na-Fmoc amino acids and coupling agents wer 
then dissolved in DMF (500 fil for each macrocrown) and 
activated by. the addition of DIPEA. The appropriate 
derivatives were then dipensed to their appropriate wells 
and as standard coupling to each macro crown was allowed 
to proceed for 2 hours. 



When coupling particularly hindered amino acid residues 
such as N-Methyl, Cor-Methyl or unusual amino acids (whose 
coupling efficiency is unknown) the coupling reaction was 
repeated, as standard, for a further 2 hours. 



Substrates for Der pi 



Using the general techniques described above, the 
following compounds were prepared and assayed as potential 
substrates against Der pi purified as described above. 

is Peptide [SEQ ID Nos. 1-76] Measured k. ( M M) 



Abz-Val-Ala-Nle-Ser-Tyr(N0 2 )-Asp-NH 2 
H-Val - Ala -Nle-Ser-TyrN0 2 -Asp-NH 2 
H-Ala-Nle-Ser-Tyr(N0 2 ) -Asp-NH 2 
Ac - Val -Ala-Nle-Ser- Tyr ( N0 2 ) -Asp - NH 2 
H-Val -Ala-Nle-Ser-Tyr (N0 2 ) -NH 2 
H-Ala-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 
Ac-Val-Ala-Nle-Ser-Tyr(N0 2 ) -NH 2 
Abz-Val-Ala-Nle-Ser-Tyr (N0 2 ) -NH 2 
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Abz-Val-Ala-Nle-Ser-NH 2 


NM 




Abz-Val-Ala-Nle-Ser-Phe-Asp-NH 2 


NM 




Abz - Val - Ala -Nle -Ser-Tyr -Asp -NH 2 


NM 




Abz-Val-Ala-Nle-Ser-Ala-Asp-NH 2 


NM 


5 


Abz-Val-Ala-Nle-Ser-Lys-Asp-NH 2 


NM 




Abz-Val-Ala-Nle-Ser-eAHA-Asp-NH 2 


NM 




Abz -Ala-Nle- Ser-Tyr (N0 2 ) -Asp-NH 2 


NM 




Abz-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 


NS 




Bz -Val -Ala-Nle- Ser-Tyr (N0 2 ) -NH 2 


NM* 


10 


Bz (2-carboxy) -Val-Ala-Nle-Ser-Tyr (N0 2 ) -NH 2 


NM a 




Chex-Val-Ala-Nle-Ser~Tyr(N0 2 ) -NH 2 


NM* 




n-Bu-Val-Ala-Nle-Ser-Tyr (N0 2 ) -NH 2 


NM* 




Piv-Val-Ala-Nle-Ser-Tyr (NO, J -NH 2 


NM* 




Bz-Val-Ala-Nle-Ser-Tyr (N0 2 ) -NH 2 


NM* 


15 


Abz -Val -Ala-Lys -Ser-Tyr (N0 2 ) -Asp-NH 2 


14 




Abz-Val-Ala-Gln-Ser-Tyr (N0 2 ) -Asp-NH 2 


6 




Abz-Val-Ala-Thr-Ser-Tyr (N0 2 ) -Asp-NH 2 


6 




Abz-Val-Ala-hLeu-Ser-Tyr<N0 2 ) -Asp-NH 2 


4 




Abz-Val-Ala-Cha-Ser-Tyr (N0 2 ) -Asp-NH 2 


5 


20 


Abz-Val-Ala-His-Ser-Tyr (N0 2 ) -Asp-NH 2 


> 20 




Abz-Val-Ala-ACH-Ser-Tyr(N0 2 ) -Asp-NH 2 


NS 




Abz-Val-A!a-DNle-Ser-Tyr(N0 2 ) -Asp-NH 2 


NS 




Abz-Val-Ala-3pyr-Ser-Tyr (N0 2 ) -Asp-NH 2 


10 




Abz-Val-Ala-Hyp-Ser-Tyr (N0 2 ) -Asp-NH 2 


NS 


25 


Abz - Val - Ala - ACP - Ser -Tyr ( N0 2 ) -Asp -NH 2 


NS 




Abz-Val-Lys-hLeu-Ser-Tyr (N0 2 ) -Asp-NH 2 


35 




Abz - Val -DAla-hLeu - Ser-Tyr ( N0 2 ) -Asp - NH 2 


NS 
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adz vaj.-Tic-nLeu-Ser-Tyr (N0 2 ) -Asp-NH 2 


NS 




adz vax-ALH-nLeu-Ser-Tyr (N0 2 ) -Asp-NH 2 


NS 




adz vai-Met (0) -hLeu-Ser-Tyr (N0 2 ) -Asp-NH 2 


35 




ADZ-Val-2Nal-hLeu-Ser-Tyr (N0 2 ) -Asp-NH 2 


NS 




Aoz-Val-ACP-hLeu-Ser-Tyr (N0 2 ) -Asp-NH 2 


NS 




adz - Va 1 -fiLy s - hLeu - Ser - Ty r ( N0 2 ) -Asp - NH 2 


NS 




ADZ-Val-BGln-hLeu-Ser-Tyr (N0 2 ) -Asp-NH 2 


NS 




Abz-Val-3pyr-hLeu-Ser-Tyr(N0 2 ) -Asp-NH 2 


NS 




Abz-Val-Cha-hLeu-Ser-Tyr (N0 2 ) -Asp-NH 2 


NS 


10 


Abz-DVal-Ala-hLeu-Ser-Tyr(N0 2 ) -Asp-NH 2 


NS 




Abz-Gln-Ala-hLeu-Ser-Tyr(N0 2 ) -Asp-NH 2 


12 




Abz-Lys -Ala-hLeu-Ser-Tyr (NO,) -Asp-NH 2 


>15 




Abz-Tic-Ala-hLeu-Ser-Tyr(N0 2 ) -Asp-NH 2 


NS 




Abz-ACH-Ala-hLeu-Ser-Tyr (N0 2 ) -Asp-NH 2 


NS 


lb 


Abz-Met(O) -Ala-hLeu-Ser-Tyr (N0 2 )-Asp-NH 2 


20 




Abz-3pyr-Ala-hLeu-Ser-Tyr(N0 2 ) -Asp-NH 2 


>10 




Abz-2Nal -Ala-hLeu-Ser-Tyr <N0 2 ) -Asp-NH 2 


15 




Abz-Leu-Ala-hLeu-Ser-Tyr (N0 2 ) -Asp-NH 2 


18 




Abz-Cha-Ala-hLeu-Ser-Tyr(N0 2 ) -Asp-NH 2 


9 




Abz-Bip-Ala-hLeu-Ser-Tyr(N0 3 ) -Asp-NH 2 


2.5 




Abz-Bip-Ala-hXeu-Tyr-Tyr (N0 2 ) -Asp-NH 2 


3 




ADZ-Bip-Ala-hLeu-Leu-Tyr (N0 2 ) -Asp-NH 2 


3.7 




Abz -Bip-Ala-hLeu- Lys-Tyr ( N0 2 ) -Asp-NH 2 


2 




Abz-Bip-Ala-hLeu-Asp-Tyr (N0 2 ) -Asp-NH 2 


5.0 


25 


Abz-Bip-Ala-hLeu-Abu-Tyr (N0 2 ) -Asp-NH 2 


1.7 




Abz-Bip-Ala-hLeu-Cha-Tyr(N0 2 )-Asp-NH 2 


2.5 




Abz-Bip-Ala-hLeu-Met (0) -Tyr (N0 2 ) -Asp-NH 2 


5 
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Abz-Bip-Ala-hLeu-Thr-Tyr(N0 2 ) -Asp-NH 2 2.5 

Abz-Bip-Ala-hLeu-3pyr-Tyr (N0 2 ) -Asp-NH 2 4 

Abz-Bip-Ala-hLeu-Bu'Gly-Tyr (N0 2 ) -Asp-NH 2 4 

Abz-Bip-Ala-hLeu-Hyp-Tyr (N0 2 ) -Asp-NH 2 4 

5 Abz-Phe-Val-Ala-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 NM 

Abz - 3 . Pyr - Val - Al a -Nl e - Se r - Tyr ( N0 2 > -Asp-NH 2 NM 

Abz-l.Naph-Val-Ala-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 17 

Abz-2 .Naph-Val-Ala-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 NM 

Abz-Tyr-Val-Ala-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 NM 

10 Abz-Bip-Val-Ala-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 10 

Abz-Lys-Val-Ala-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 15 

Abz-Glu-Val-Ala-Nle-Ser-Tyr (N0 3 ) -Asp-NH 2 20 

Abz-Leu-Val-Ala-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 NM 

Abz-Hyp-Val-Ala-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 NS 



15 NS indicates that the peptide was not hydrolysed by Der 
FI- 
NN indicates that the peptide was a substrate for Der pi, 
but the K„ was not measured. 

NM a indicates that the peptide was a substrate for Der pi, 
20 and its cleavage was followed by HPLC-MS showing 
hydrolysis to occur between-Nle-Ser- . 

Ranked in order of cleavage rate : -Bz > n-But > Piv > 
Bz (2-carboxy) > Abz. 



SUBSTITUTE SHEET (RULE 26) 



WO 97/42216 



PCT/GB97/01158 



- 59 - 

Coupling of amino acid residues was carried out according 
to the multipin approach described above. Whilst the 
multipin assembly was drying, the appropriate Na-Fmoc 
amino acids (10 equivalents calculated from the loading of 
each crown), HATU coupling agent (9.9 equivalents 
calculated from the loading of each crown) , HOAt catalyst 
(9.9 equivalents calculated from the loading of each 
crown) and DIPEA (19.9 equivalents calculated from the 
loading of each crown) were accurately weighed into 
suitable containers. 



15 



The protected Na-Fmoc amino acids and coupling agents were 
then dissolved in DMF (500 M l for each macrocrown) and 
activated by the addition of DIPEA. The appropriate 
derivatives were then dispensed to their appropriate wells 
and standard coupling to each macrocrown was allowed to 
proceed for 2 hours. 



20 



When coupling particularly hindered amino acid residues 
such as N-Methyl, Ca-Methyl or unusual amino acids (whose 
coupling efficiency is unknown) the coupling reaction was 
repeated, as standard, for a further 2 hours. 

The following sequences were synthesised in this way: 

Peptide Sequence [SEQ ID Nos. 77-85] Measured Km(/uM) 

Abz-Val-Ala- (NMe)Nle-Ser-Tyr(N0 3 ) -Asp-NH a N S 
Abz-Val- (NMe)Ala-Nle-Ser-Tyr(NO J ) -Asp-NH, NS 
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Abz-Val-Ala-Aib-Ser-Tyr (N0 2 ) -Asp-NH, NS 

Abz-Val-Aib-Nle-Ser-Tyr (NO.) -Asp-NH 2 NS 

Abz-Deg-Ala-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 NS 

nBu-D.Ser-D.Nle-D.Ala-D.Val-p.Aba-NH 2 NS 

5 Bz-Val-Ala-Statine-Ser-eAha-NH 2 NS 

Abz-p.Aba-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 NS 

Abz-Cmpi-Nle-Ser-Tyr (N0 2 ) -Asp-NH 2 NS 



NS-Not Substrate: Not hydrolysed by Der pi 

NM -Substrate but not measured: Substrate for Der pi but 
10 not measured. 

Assay procedure 

Each mixture of 20 compounds in the libraries of the 
apparatus described herein was screened at a concentration 
of 1.0 per compound in an assay against the cysteinyl 
is protease Der pi. The most active wells were identified by 
the rate of emission of fluorescence at 42 0 nm when the 
samples were irradiated at 320 nm. An analysis of the two 
complementary libraries showed that the best substrates 
for the enzyme were: 

20 Abz-B-C-D-E-Tyr (N02 ) -Asp-NH2 

Where 

B= Valine>Alanine, Glutamine, Leucine, Phenylalanine 

C= Alanine>>Glutamine, or Lysine. 

D= Leucine, Norleucine or Alanine >Serine 
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E= Serine 



The best substrate was: 

[SEQ ID:1] Abz-Val-Ala-Nle-Ser-Tyr(NOj) -Asp-NH, 



This compound was then resynthesised as a single component 
using the peptide synthesis methodology described herein. 
The K*JK, value for the pure substrate in the Der pi assay 
was measured as 3 .5xl0«M l s », and was considered to be 
suitably high for use in a high throughput assay for the 
general screening of inhibitors of Der pi . 



High throughput assay development 

Plate assays were carried out in 96 well plate format, 
using 0.1 ug of Der pi per 100 nh assay volume in each 
well and using 20 fM of the substrate. All assays were 
performed in Assay Buffer (AB; 50 mM potassium phosphate, 
pH 8.25 containing ImM ethylenediaminetetraacetic acid 
(EDTA) and ImM dithiothreitol (DTT) . The Der pi enzyme is 
pre-activated by addition of DTT and this in incubated at 
room temperature for 5 min. prior to initiation of the 
assay. As an example for the screening methodology, each 
well contains a 5 (tL of a 20 fM solution of the test 
compound in DMSO, 10 of a 200 tM aqueous solution of 
the substrate, 2, and 85 fxh of Der pi in AB is added to 
initiate the reaction. Enzyme activity is monitored by 
fluorescence using 320 nm for excitation and 420nm for the 
emission wavelengths using a Labsystems Fluroskan Ascent 
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machine. Kinetic measurements were carried out using a 
Hitachi F-4500 Fluorescence Spectrophotometer. 

Synthesis of inhibitors of Der pi 

The best substrate described above was shown by HPLC-mass 
spectroscopic analysis of the enzyme/ assay solution, to 
be cleaved between the Norleucine-Serine amide bond. 
Replacement of the terminal Abz group by a series of 
derivatives (e.g. Boc-, Pivaloyl, Benzoyl, and 2- carboxy- 
Benzoyl) affected substrate activity and specificity for 
the Der pi enzyme. With this knowledge of the P x -P 1 t 
cleavage site and for the P 4 -P 3 -P 2 -Pi motif, the compound 
Boc-Val-Ala-Leu-H, 4, was synthesised as shown in Scheme 
la, figure 15. 

Attachment of a suitable Michael acceptor such as CH=CH- 
C02Et, and -CH=CH-S0 2 Ph to the motif (Scheme 2, figure 
16) , provided active inhibitors of the enzyme with 
apparent IC 50 values of 50nM, 1000 nM and lOOnM 
respectively. 

Based on the sequence 4, a series of acyloxymethylketone 
compounds having active Der PI inhibitor activity was 
prepared. Details of the preparation and activity of this 
novel group of cysteine protease inhibitor compounds are 
discussed in co-pending International Application 
Publication Number WO97/04004 . 

Example 2: Viral Protease Inhibitors and Substrates 
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Design of Depsipeptides 

Another suitable bond in a compound of general formula 
(I), (II) or (III) according to the invention is an ester 
bond to form a depsipeptide . The incorporation of 
5 depsipeptide substrates aided the identification of 

substrates for low reactivity viral proteases, such as 
viral serine proteases. 

For example, substrates of the general formula 
n [Abz-B^o-C^-D^ytCOO] -E^-Tyr (N0 2 ) -Asp-NH 2 ] 
10 were produced. 

However, a significant proportion of viral proteases only 
recognise substrate sequences larger than those 
represented by the general structure above. It is well 
acknowledged that by the very nature of action of a viral 

15 protease (function is to cleave immature viral proteins 

into the mature viral package) one automatically receives 
data concerning the natural substrate sites. Thus, the 
general structure above can be extended by introducing 
extra fixed amino-acids at appropriate sites. A logical 

20 extension would be to introduce the known Pl-Pl' cleavage 
site as the depsipeptide bond, then subsequently introduce 
the four variant positions following the standard format 
thus ; 
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n rAbz-B^o-C^o-D^-E^-Pl y[COO] -PI ' -Tyr (N0 2 ) -Asp-NH 2 ] 

Furthermore, if these substrates again proved to be too 
small, one may use the known substrate sequences to 
introduce additional fixed positions. For instance, with 
5 Hepatitis NS3 protease it is known that the natural P6 
position is a conserved acidic residue (aspartic or 
glutamic acid) . Thus one could extend the above structure 
as detailed below. 



n [Abz-PS-B^o-C^o-D^-E^-Pl y [COO] -PI ' -Tyr <N0 2 ) -Asp-NH 2 ] 



10 The novel methodology descriB&jk herein greatly faciltiates 
the invention^of therapeutically useful proteolytic enzyme 
inhibitors and is commercially exploitable. This is 



because the best substrate moti^/fbr the proteolytic 
enzyme can be rapidly identified, and, since there exist 
15 in the literature a variety of ways for attaching motifs 
which react with the^active site of a proteolytic enzyme, 



especially for aspartyl, metallo, serine and cysteinyl 

'/ 

proteases, an enzyme inhibitor can be readily synthesised. 
Moreover, amide bond replacements or transition state 
20 mimetics can b<e incorporated into the molecule, which 

would be especialiycruseful for the^inhibition of aspartyl 

" ' ^ " 

or metallo proteases. 



The method described also facilitates the rapid 
development of a screening assay for novel protease 
25 inhibitors. The most potent fluorogenic substrate 

discovered by library screening can subsequently be used 
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for the detection of inhibitors of the particular 
proteolytic enzyme under scrutiny. 

The presence of an inhibitor within the compound libraries 
described is readily detected by retreatment of the assay 
mixture with the most active fluorogenic substrate, which 
will allow the immediate measurement of the remaining 
proteolytic enzyme activity. 

The invention provides self -decoding, combinatorial 
fluorogenic libraries, and it will greatly facilitate the 
design and invention of novel protease inhibitors because: 

The peptides of the library may have 
increased aqueous solubility in comparison 
to peptides containing similar and other 
fluorogenic and quencher groups. 

The peptides are stable to contaminating 
exopeptidases. 

The self deconvolution method described, 
coi^^d^wi%h^tl^ r 'c^ontihuous analysis of the 
rate of substrate cleavage data, allows the 
immediate identification of the most active 
binding motif contained within the 
substrate library. 

The method allows for the rapid assessment 
of the enzyme assay mixture for any 
compounds in the library that are acting as 
enzyme inhibi tors . 



is ii . 



iii . 



20 



iv. 
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Example 3: Kinase Inhibitors and Substrates 

As previously indicated, the present invention is a 
broadly applicable methodology and is not limited to the 
use of FRET libraries to detect protease 
inhibitor/substrate compounds. 

An example of another use of the invention is the 
detection of the interaction of a kinase with a 
substrate/inhibitor by monitoring radioactively labelled 
phosphorylation of compounds of complementary 
combinatorial libraries. 

The Identification Q g a ZAP-7Q TyrQgjge. Kinase Inhibitor 

ZAP-70 is a intracellular kinase essential for T-cell 
signal transduction from the T-cell antigen receptor. 
Mutations resulting in non- functional ZAP-70 kinase have 
been observed in patients suffering from a severe combined 
immunodeficiency disease. Drug down -modulation of ZAP-70 
kinase activity has therefore been seen by the major 
pharmaceutical companies as a potential method of down- 
regulating the immune system in an antigen -independent 
manner. Such a drug could be used in both transplant 
treatment and as a therapy in autoimmune diseases. 

Generation of ZAP-70 kinase catalytic domain 

The full length protein ZAP-70 tyrosine kinase possesses 
amino -terminal tandem SH2 domains which allow enzyme 
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recruitment to phosphorylated tyrosine residues on the T 
cell antigen receptor, but are unnecessary for protein 
kinase catalytic activity. Removal of the SH2 domains 
results in a constitutively active, stable ZAP-70 kinase. 
Specific oligonucleotide polymerase chain reaction (PCR) 
primers were used to amplify amino acids 308 to 619 and 
add a carboxy- terminal hexahistidine sequence using human 
Jurkat cDNA as a template. This amplicon was cloned into 
the baculoviral transfer vector pACUWSl (Pharmingen) and 
recombinant baculovirus produced by homologous 
recombination in Sf9 insect cells. 

CatZAP(H6) Purification 

Low titre stocks of mixed CatZAP-70 baculovirus were used 
to infect sub-confluent monolayers of Sf9 cells. Cells 
were harvested 3 days post-infection, washed in PBS and 
osmotic lysis performed immediately. Isolation of 
CatZAP(HJ protein was carried out using cobalt column 
chelation chromatography, fractions eluted using 100 mM 
imidazole and kinase activity determined by 33 P transfer. 

*Uqh throughput protein kinase scree n ing assay 

The interaction of the active protein kinase moiety with 
compounds of a combinatorial peptide library can be 
determined. 

A rapid and very sensitive assay was developed for 
measuring the amount of peptide substrate phosphorylation 
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produced by CatZAP{H 6 ) . The peptide library was 
synthesised with a positively charged tetra-lysine tag. 
After peptide incubation with the protein kinase and ATP, 
containing trace amounts of radiolabeled P 3 P-y1 -ATP, the 
reaction mixture is passed through a negatively charged 
phosphocellulose membrane using a 96 well plate vacuum 
manifold system (Millipore) . Positively charged peptide 
is bound by the membrane, whereas unincorporated [33P-y] - 
ATP is washed through. Scintillant is then applied to the 
membrane and the amount of phosphorylated peptide 
calculated from the level of radioactivity on the 
membrane, measured using a well plate scintillation 
counter. This type of assay can be readily applied to 
other protein kinase systems. 

The result of this screening assay is the identification 
of one or more protein kinase substrate motifs, which 
appear to have the common features of a negatively charged 
N-terminus and a conserved Pro at the C-terminus. 

For other protein tyrosine kinases, the existing 
libraries may provide some information, although it is 
envisaged that additional libraries will require to be 
built. For Ser/Thr kinases, the technology may be adapted 
by the use of a central Ser or Thr residue as the 
phosphoacceptor moiety. 

In addition, technology is being developed within the 
chemistry department which will allow for the generation 
of cyclic peptide libraries. Incorporation of a central 
phosphoacceptor residue in these libraries will provide a 
source of constrained molecules for use in kinase assays. 
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Abbreviations 

Abbreviations used herein are as follows: 

Abbreviations for amino acids and nomenclature of peptide 
structures follow the recommendations given in: IUPAC-IUB 
Commission on Biochemical Nomenclature, fj. Biol. Chem, 
247, 997, 1971). All chiral amino acids are of the L 
configuration unless otherwise stated. Other abbreviations 
used are : 

-Abu , b-amino butyric acid, : Abz, 2-amino benzoyl : ACH, 
1-amino-l-carboxy-cyclohexane : ACP, 1 -amino- 1-carboxy- 
cyclopropane : Bip, Biphenyl alanine : n-Bu, n- 
butoxycarbonyl : Bz, Benzoyl : Bz (2-carboxy) , 2- 
carboxybenzoyl : Bu c Gly, tert-Butylglycyl : BOP, 
benzotriazoyl-oxy-tris- (dime thy lamino) -phosphonium 
hexafluorophosphate: Cha, eye lohexyl alanine : Chex, 1- 
carboxycyclohexyl : eAHA, gamma aminohexanoyl : HBTU, 0- 
benzotriazoyl-N,N,N' ,N' -tetramethyluronium 
hexafluorophosphate : HOBt , 1-hydroxybenzotriazole : Hyp, 
trans-4-hydroxyprolinyl : hLeu, homoleucyl : 2Nal, 2- 
napthylalanine : NMM, N-methylmorpholine : Piv, pivoyl : 
3pyr, 3 -pyr idyl alanine : Tic, 2-carboxytetrahydroquinolyl 
: Tyr(N0 2 ), 3 -nitrotyrosine . 

DMF, dimethyl formamide; Fmoc, f luorenylmethoxycarbonyl ; 
HPLC, high performance liquid chromatography; pfp, 
pentafluorophenyl, tBoc, tert-butoxycarbonyl ; tBu, tert- 
butyl; TFA, trif luoroacetic acid; Pmc, pentamethyl 



15 
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chroman, Pbf, pentamethylbenzofuran, TBTU, 2-(lH- 
Benztrot riazole- 1 -yl )-l,l,l,3,3- Cetrame thyluroniurn 
tetraf luorobprate; Trt, Trityl . 

p. Aba, 4-aminobenzoyl; Aib, Aminoisobutyric acid; Bip, 
5 Biphenylalanine; nBu, n-Butyl; Bz, Benzoyl; Cmpi, 

Carboxymethylpiperazine; Deg, Diethylglycine; DIPEA, N,N- 
Diisopropyl-ethylamine; HATU, 0- (7-azabezotriazol-l-yl) - 
1, 1,3, 3-tetramethyluronium hexaf luorophosphate; HOAT, 1- 
hydroxy-7-azabenzotriazole; Naph, Naphthylalanine; 3.Pyr, 
10 3-pyridiylalanine; Tyr(N0 2 ), 3-nitro- tyrosine . 
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1. Libraries of compounds which interact with an active 
moiety, the libraries comprising two orthogonal sets of 
mixtures of compounds in solution providing two 
complementary combinatorial libraries (referred to as 
primary and secondary libraries) indexed in two dimensions 
for autodeconvolution, thereby allowing characterisation 
of an active motif in any group of compounds. 



2. A complementary pair of compound libraries LI and L2 
according to claim 1 which constitute a set containing 
compounds having four variable groups B, C, D, E of 
formula : 



-Bb-Cc-Dd-n(Ee) - 

giving bxcxdxe=Mn compounds in each library, there 
being a predetermined number (Pi, P2) of mixtures each 
consisting of a predetermined number (Ql, Q2) of 
individual identifiable compounds in each library, wherein 
both LI and L2 contain the same Mn compounds, but wherein 
any two compounds which are found together in one mixture 
of Ql compounds of LI are not found together in any one of 
the P2 mixtures of L2. 



3 . A method of screening for interaction of an active 
moiety applied to a well with one or more compounds of a 
mixture in the well, using the libraries LI, L2 according 
to claim 2, in which the PI mixtures of LI and the P2 
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mixtures of L2 are each placed separately into individual 
wells of well plates, the well plates having wells 
arranged in a format adapted to allow deduction of a 
unique active compound formula from the presence of 
5 activity in one well of LI and one well of L2. 

4 . A method according to claim 3 wherein the format 
complies with general deconvolution formulae in which: 

(i) ns = Rp.Cp. Np 



Rs . Cs 



10 



(ii) 



k = b . c . d . np . e 



(iii) 



k = x.N.np 



(iv) 



N = Rp.Cp 



(v) 



K = X.Rp.Cp.np 



(vi) 



b.c.d.e = 



X . Rp . Cp 



15 



(Vii) 



Cp.e = X 



(viii) 



Rp.e 



= X, if Rp=Cp 



and wherein 
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np = number of primary plates 

ns = number of secondary plates 

Rp = number of primary rows 

Rs = number of secondary rows 

5 Cp = number of primary columns 

Cs = number of secondary columns 

K = number of combinations of compounds 

N = number of wells on a plate, and 

X = number of compounds per well. 



A method according to claim 4 wherein 



np 4 

ns = 16 

Rp = 8 

Rs = 4 

15 Cp = 10 

Cs = 5 

K = 6400 

N = 80 

X = 20 



6. A method according to claim 3 wherein the active 
moiety is selected from the group consisting of enzymes, 
receptors and antibodies. 
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7. A method according to claim 6 wherein interaction of 
the active moiety with the compounds of the library is 
selected from the group consisting of the interaction of a 
substrate or inhibitor with an enzyme, the interaction of 

5 a ligand with a receptor, and the interaction of an 
antigen or antigenic epitope with an antibody. 

8. A method according to claim 6 wherein the active 
moiety is selected from the group consisting of the 
following enzyme classes: 

10 1 . Oxidoreductase 

a ) dehydrogenase 

b) oxidase 

c) peroxidase 

d) catalase 

is 2 . Hydrolase 

f) peptidase (proteolytic enzyme) 

3 . Transferase 

g) aminotransferase 

h) kinase 

20 i) glucosyl transferase 

4 . Lyase 

j ) decarboxylase 
k) dehydratease 
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5 . Isomerase 
1) racemase 
m) mutase 

6 . Ligase 

n) synthetase 
o) carboxylase 

9. A method according to claim 8 wherein the active 
moiety is a protease selected from the group consisting of 
the following: 

1. Aspartyl proteases, such as renin, HIV, cathepsin 
D and cathespin E. 

2. Metalloproteases, such as ECE, gelatinase A and B, 
collagenases , stromolysins . 

3. Cysteinyl proteases, such as apopain, ICI, DerPI, 
cathepsin B, cathepsin K. 

4. Serine proteases, such as thrombin, factor Vila, 
factor Xa, elastase, trypsin. 

5. Threonyl proteases, such as proteasome S. 



SUBSTITUTE SHEET (RULE 26) 



WO 97/42216 



PCT/GB97/01158 



- 78 - 

10. A method according to claim 3 wherein the active 
moiety is a kinase and the interaction is with a kinase 
substrate or . inhibitor . 

11. A method according to claim 3 wherein the active 

5 moiety is a receptor and the interaction is with a ligand. 

12 . A method according to claim 3 wherein the active 
moiety is an antibody and the interaction is with an 
antigen or antigenic epitope. 

13 . A method according to claim 3 wherein the mixture of 
10 compounds in all of the wells of primary library Ll 

contain all variations of a first two variable groups and 
each well contains a unique pair of the two other variable 
groups; and wherein the mixture of compounds in all of the 
wells of secondary library L2 contain all variations of 
15 the two other variable groups and each well contains a 
unique pair of the first two variable groups. 

14 . A method according to claim 13 wherein the primary 
library contains all variations of variable groups B and 
E, and unique pairs of variable groups C and D; and the 

20 secondary library contains all variations of variable 

groups C and D, and unique pairs of variable groups B and 
E. 

15. A method according to claim 3 or 4 which comprises 
the steps of forming libraries according to claim 1 or 2, 
25 placing the libraries into the wells of well plates in a 
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format according to claim 3 or 4, applying an active 
moiety to the wells, screening for an interaction of the 
active moiety with one or more compounds of a mixture in a 
well of primary library LI and secondary library L2 and 
deducing from the respective LI and L2 well positions a 
unique active compound formula. 
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Figure 1 
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First Example of Library Matrix where n=l 
Component Distribution in Plate 1, Library 1 (n=rl) 
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Figure 2 
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Component Location in Plate 1, Library 2 (n=l) . 
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Figure 3 
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Component Location in Plate 2, Library 2 (n=l) 
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Figure 4 
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Component Location in Plate 3, Library 2 (n=l) . 
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Figure 5 
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Component Location in Plate 4, Library 2 (n=l) . 
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Example Library where n=4 

Component Distribution in Plate 1, Library 1 (n=4) 
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Figure 7 
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Component Location in Plate 2, Library 1 (n*4) 
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Component Location in Plate 3, Library 1 (n=4) 
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Component Location in Plate 4, Library 1 (n=4) . 
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